AITopics | French Southern Territories

Collaborating Authors

French Southern Territories

Evaluating Large Language Models for IUCN Red List Species Information

arXiv.org Artificial IntelligenceOct-6-2025

Large Language Models (LLMs) are rapidly being adopted in conservation to address the biodiversity crisis, yet their reliability for species evaluation is uncertain. This study systematically validates five leading models on 21,955 species across four core IUCN Red List assessment components: taxonomy, conservation status, distribution, and threats. A critical paradox was revealed: models excelled at taxonomic classification (94.9%) but consistently failed at conservation reasoning (27.2% for status assessment). This knowledge-reasoning gap, evident across all models, suggests inherent architectural constraints, not just data limitations. Furthermore, models exhibited systematic biases favoring charismatic vertebrates, potentially amplifying existing conservation inequities. These findings delineate clear boundaries for responsible LLM deployment: they are powerful tools for information retrieval but require human oversight for judgment-based decisions. A hybrid approach is recommended, where LLMs augment expert capacity while human experts retain sole authority over risk assessment and policy.

accuracy, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2510.0283

Country:

Africa > Saint Helena, Ascension and Tristan da Cunha (0.28)
North America > United States (0.14)
Oceania > Australia > Australian Indian Ocean Territories > Territory of Cocos (Keeling) Islands (0.14)
(236 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.70)

Industry: Information Technology > Security & Privacy (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.93)

Add feedback

MIRAI: Evaluating LLM Agents for Event Forecasting

Ye, Chenchen, Hu, Ziniu, Deng, Yihe, Huang, Zijie, Ma, Mingyu Derek, Zhu, Yanqiao, Wang, Wei

arXiv.org Artificial IntelligenceJul-1-2024

Recent advancements in Large Language Models (LLMs) have empowered LLM agents to autonomously collect world information, over which to conduct reasoning to solve complex problems. Given this capability, increasing interests have been put into employing LLM agents for predicting international events, which can influence decision-making and shape policy development on an international scale. Despite such a growing interest, there is a lack of a rigorous benchmark of LLM agents' forecasting capability and reliability. To address this gap, we introduce MIRAI, a novel benchmark designed to systematically evaluate LLM agents as temporal forecasters in the context of international events. Our benchmark features an agentic environment with tools for accessing an extensive database of historical, structured events and textual news articles. We refine the GDELT event database with careful cleaning and parsing to curate a series of relational prediction tasks with varying forecasting horizons, assessing LLM agents' abilities from short-term to long-term forecasting. We further implement APIs to enable LLM agents to utilize different tools via a code-based interface. In summary, MIRAI comprehensively evaluates the agents' capabilities in three dimensions: 1) autonomously source and integrate critical information from large global databases; 2) write codes using domain-specific APIs and libraries for tool-use; and 3) jointly reason over historical knowledge from diverse formats and time to accurately predict future events. Through comprehensive benchmarking, we aim to establish a reliable framework for assessing the capabilities of LLM agents in forecasting international events, thereby contributing to the development of more accurate and trustworthy models for international relation analysis.

cameocode, isocode, relation, (15 more...)

arXiv.org Artificial Intelligence

2407.01231

Country:

Asia > North Korea (0.14)
Oceania > Australia > Australian Indian Ocean Territories > Territory of Cocos (Keeling) Islands (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
(234 more...)

Genre: Research Report > New Finding (0.45)

Industry:

Law (1.00)
Government > Foreign Policy (1.00)
Government > Military (0.93)
Information Technology (0.92)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Graph-based Virtual Sensing from Sparse and Partial Multivariate Observations

De Felice, Giovanni, Cini, Andrea, Zambon, Daniele, Gusev, Vladimir V., Alippi, Cesare

arXiv.org Artificial IntelligenceFeb-19-2024

Virtual sensing techniques allow for inferring signals at new unmonitored locations by exploiting spatio-temporal measurements coming from physical sensors at different locations. However, as the sensor coverage becomes sparse due to costs or other constraints, physical proximity cannot be used to support interpolation. In this paper, we overcome this challenge by leveraging dependencies between the target variable and a set of correlated variables (covariates) that can frequently be associated with each location of interest. From this viewpoint, covariates provide partial observability, and the problem consists of inferring values for unobserved channels by exploiting observations at other locations to learn how such variables can correlate. We introduce a novel graph-based methodology to exploit such relationships and design a graph deep learning architecture, named GgNet, implementing the framework. The proposed approach relies on propagating information over a nested graph structure that is used to learn dependencies between variables as well as locations. GgNet is extensively evaluated under different virtual sensing scenarios, demonstrating higher reconstruction accuracy compared to the state-of-the-art.

conference paper, dataset, dependency, (17 more...)

arXiv.org Artificial Intelligence

2402.12598

Country:

North America > United States (0.67)
Indian Ocean (0.04)
Antarctica > French Southern Territories > Kerguelen > Port-aux-Francais (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)

Genre: Research Report (0.50)

Industry:

Energy > Renewable > Solar (0.69)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (1.00)
Information Technology > Data Science > Data Mining (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

AI Distinguishes Cancer Cells From Healthy Ones - AI Summary

#artificialintelligenceJun-14-2022, 07:42:56 GMT

A machine learning program has been used to distinguish between cancerous and healthy cells by identifying patterns in characteristic combinations of genes.

ai distinguish cancer cell, machine learning, republic, (12 more...)

#artificialintelligence

Country:

Oceania > Australia > Australian Indian Ocean Territories > Territory of Cocos (Keeling) Islands (0.18)
Oceania > Samoa (0.08)
Europe > Netherlands (0.08)
(228 more...)

Industry: Health & Medicine > Therapeutic Area > Oncology (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

Python Computer Vision Course

#artificialintelligenceJun-27-2020, 01:16:43 GMT

Learn Computer Vision. Introduction course to Computer Vision with Python. Make Computer Vision Apps? Learn Computer Vision theory? Build a strong portfolio with Computer Vision & Image Processing Projects? Looking to add Computer Vision algorithms in your current software project ? Whatever be your motivation to learn Computer Vision, I can assure you that you’ve come to the right course. You get. Complete course with 1 hour of video tutorials, Source code for all examples in the course. What you'll learn. Use basic Computer Vision techniques. Do image processing. Build: Image Similarity app, Face Detection app and Object Detection app! Master Computer Vision! .

artificial intelligence, python computer vision course, republic, (6 more...)

#artificialintelligence

Country:

Africa > Saint Helena, Ascension and Tristan da Cunha (0.32)
Oceania > Australia > Australian Indian Ocean Territories > Territory of Cocos (Keeling) Islands (0.17)
Asia > Laos (0.17)
(220 more...)

Technology: Information Technology > Artificial Intelligence > Vision (1.00)

Add feedback

AI/ML Bootcamp

#artificialintelligenceNov-9-2019, 15:56:40 GMT

By registering, you agree to the AWS Event Terms and Conditions and the AWS Community Codes of Conduct. By completing this form, I agree that I'd like to receive information from Amazon Web Services, Inc. and its affiliates related to AWS services, events and special offers, and my AWS needs by email and post. You may unsubscribe at any time by following the instructions in the communications received. By completing this form, I agree that I'd like to receive information from Amazon Web Services, Inc. and its affiliates related to AWS services, events and special offers, and my AWS needs by email and post. You may unsubscribe at any time by following the instructions in the communications received.

ai ml bootcamp, event and special offer, information, (12 more...)

#artificialintelligence

Country:

Oceania > Australia > Australian Indian Ocean Territories > Territory of Cocos (Keeling) Islands (0.20)
South America > Falkland Islands (0.07)
Oceania > Northern Mariana Islands (0.07)
(23 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

AI For Marketers: An Introduction and Primer, Second Edition

#artificialintelligenceOct-25-2019, 23:19:14 GMT

Keep on file Card Number We do not keep any of your sensitive credit card information on file with us unless you ask us to after this purchase is complete. Your rental will be available for 30 days. Once started, you'll have 72 hours to watch it as much as you'd like! You'll need an account to access this in our app. Please create a password to continue. You agree to our Terms Of Use.

app store, introduction and primer, republic, (9 more...)

#artificialintelligence

Country:

Africa > Saint Helena, Ascension and Tristan da Cunha (0.31)
Oceania > Australia > Australian Indian Ocean Territories > Territory of Cocos (Keeling) Islands (0.16)
Asia > Laos (0.16)
(220 more...)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

bcr vidcast 107: AI governance, what are AI and ML, and the future is not here yet - Better Communication Results

#artificialintelligenceApr-15-2019, 02:44:07 GMT

Vikram Mahidhar reminds us all that AI is only as good as the humans supervising it and programming it. The biases and artefacts that come out of the processing are reflective of the biases programmed in at the beginning. A program trained to recognise totalled car bodies for insurance purposes, for example, will need close supervision of its decision-making outputs, for regulatory and consumer confidence and acceptance of the decision. There is a call and a growth in a new class of AI--one that is explainable, and that builds trust by providing evidence. Vikram also reminds us that a governance strategy is key to engendering trust in our organisation, processes and people.

ai and ml, artificial intelligence, natural language, (15 more...)

#artificialintelligence

Country:

Oceania > Samoa (0.06)
North America > United States (0.06)
South America > Venezuela (0.05)
(233 more...)

Technology: Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.49)

Add feedback